Topology Inspired Problems for Cellular Automata, and a 
Counterexample in Topology* 



Ville Salo Ilkka Torma 

TUCS - Turku Center for Computer Science, University of Turku, Finland 

University of Turku, Finland _ . . . 

J iatorm@utu.fi 

vosalo@utu.fi 



We consider two relatively natural topologizations of the set of all cellular automata on a fixed al- 
phabet. The first turns out to be rather pathological, in that the countable space becomes neither 
first-countable nor sequential. Also, reversible automata form a closed set, while surjective ones are 
dense. The second topology, which is induced by a metric, is studied in more detail. Continuity of 
composition (under certain restrictions) and inversion, as well as closedness of the set of surjective 
automata, are proved, and some counterexamples are given. We then generalize this space, in the 
sense that every shift-invariant measure on the configuration space induces a pseudometric on cellu- 
lar automata, and study the properties of these spaces. We also characterize the pseudometric spaces 
using the Besicovitch distance, and show a connection to the first (pathological) space. 



1 Introduction 



Cellular automata are a class of discrete dynamical systems. They are defined on the set of all two- 
directional infinite sequences of symbols from a finite alphabet, and the dynamics are given by a local 
function, which is synchronously applied at each coordinate. Cellular automata have been previously 
studied at least in the contexts of algorithmics, computability and dynamical systems theory J9). The 
set of all cellular automata, or even a large subset of it, has been studied in relatively few articles. 
Usually, such works have been concerned with the algebraic structure of the automorphism group or 
endomorphism monoid of a subshift (HQ] [6]]. 

In the article ifTTl . we show that when the full shift is given a certain natural topology (the Besicovitch 
topology), cellular automata are exactly the continuous functions on it which compute the image of some 
cellular automaton at every point. In particular, the cellular automaton used cannot even be 'changed' 
at, say, unary points of the full shift, where not much information about the cellular automaton is shown 
in the image. The crucial idea in the proof is topologizing the set of all cellular automata, although the 
choice of topology is rather arbitrary. This is the only use of a topology on the space of cellular automata 
we are aware of. Since the space is countable, one might assume that topology is necessarily useless. 
A good counterexample to this intuition is found for instance in @, where the infinitude of primes is 
proved using topology, in a rather beautiful way. 

In hope of finding other uses for topologizing cellular automata, we introduce two naturally arising 
topologies for the set of cellular automata. One of these topologies, denned in terms of pointwise be- 
havior, turns out to be rather pathological, in that it gives a countable space which is not first-countable. 
Some other such examples can be found in lfl2l . but they are all based on a different idea, and have quite 
distinct properties from our space. 
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In addition to the naturally arising pathological example, we define a family of topologies which not 
only arise naturally, but also behave that way. The topologies are given by measures of the configuration 
space, and we call them difference set topologies. The topologies arise from measuring the difference of 
preimages of cylinders, and thus they measure the difference of two CA in one step of evolution. As one 
might guess, it turns out convergence in such a topology does not imply that the limit shares the dynam- 
ical properties of the elements of the converging sequence. However, many 'one-step' properties play 
well with our topology: in the uniform Bernoulli case, which is the most natural representative of the 
family of topologies, we prove that surjective cellular automata are a closed set using the balance prop- 
erty, that inversion is continuous, and that composition is continuous under certain natural restrictions. 
Also, our proofs of perfectness and non-closedness of well-known subspaces use interesting approxima- 
tion results: for instance, we turn the XOR automaton invertible with an arbitrarily small change in its 
local function. This was also one of the key (although also one of the simplest) ideas in the proof of JSJ. 

The paper is organized as follows. Section [2] consists of the definitions and notation used in this 
paper. In Section [3] we briefly consider some topologies on cellular automata which seem promising 
at first, but fail to satisfy some natural constraints. In particular, we show that in a topology defined by 
pointwise behavior, composition is far from continuous, and discuss its other pathological properties. 
In Section |4j we give a combinatorial description of the difference set topology given by the uniform 
Bernoulli measure, and present some of its properties with combinatorial proofs. In Section [5] we give 
the general definition of the difference set topologies for arbitrary shift-invariant measures. We study 
the properties of such topological spaces, and rephrase some of the combinatorial results in terms of 
measures. Section [6] consists of our conclusions and directions for future work. 

2 Definitions 

Let £ be a finite set, called the state set or alphabet, which we assume to always have cardinality greater 
than 1. The set L z of bi-infinite state sequences, or configurations, is called the full shift on E. If x G L z , 
then we denote by x\ the ith coordinate of x, and we adopt the shorthand notation xuj] = JtjXi+i . . .Xj. A 
word is an element of E" for some n, and we write Z* for the set of all words. We define N(r) for the 
interval [— r,r], and call w G E^W a centered word, denoting r(w) = r. We use the notation £^ for the 
set of all centered words. The indexing notations x\ and xus\ are extended to (centered) words, with vo 
being the central coordinate of a word v G £°. For v G 1° and «,w 6 E* with \u\ = \w\, we write uvw 
for the centered word sharing its center with v, and vr/ji for the word v,- • • - vj. For words u,v,w G £*, the 
notation "uvw" defines in a natural way a configuration which is w-periodic to the left and w-periodic to 
the right, with the word v starting at coordinate 0. 

We define a metric dc, called the Cantor metric, on the full shift by 

dc(x,y) = £ 

for all x,y G Z z . This definition makes £ z a compact metric space. We define the shift map a : £ z — > £ z 
by o(x)i = Xi+\. For a word w (centered or not) and n G Z, we define the cylinder of w at n by [w] n = 
{i G I z | x n+ i = Wj for all relevant i}. If W is a set of words, then [W]„ = \J w ew \ w \n- If W is finite, these 
sets are also called cylinders, and they form a clopen (closed and open) basis for the topology of Z z . 

A cellular automaton is a continuous function c : L z — > £ z with the property c o a = a o c. Al- 
ternatively, cellular automata are defined by local functions F : L N ^ — > L for some r G N such that 
c(x)i = F(jtr,-_ r) ,- +r i). Any r that can be chosen for the local function is called a radius of c, and the min- 
imal radius is denoted r(c). We also denote N(c) = N(r(c)). A cellular automaton c, being continuous, 
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defines a dynamical system (£ z ,c). The notation CA stands for the set of all CA on £ z . We denote by 
SUR and REV the sets of surjective and injective cellular automata, respectively. It is known (see [5]) that 
REV C SUR, and that bijective cellular automata are in fact reversible, that is, the inverse function is in 
CA as well. Also, surjective cellular automata are exactly those c G CA that exhibit the so-called balance 
property that |c (w)| = |I| 2r ( c ) for all w G I* with \w\ > 1, when c is considered as a word function 
c : lM+2r(c) ->x) w \. 

For u,v G £°, we denote u -< v if v = sut for some s,t G £*, and u ~ v if uo = vq. We always assume 
T is a permutation of £ without fixed points. When £ = {0, 1}, the permutation a h > a is defined as (0 1) 
in cycle notation. 

We also need some notions for topological spaces and their subsets, mostly in Section [3] A subset 
of a topological space is sequentially closed, if it contains the limits of its converging sequences. A 
topological space X is said to be 

• metrizable, if the topology can be induced by a metric, 

• normal, if any two disjoint closed subsets of X can be separated with open neighborhoods, 

• completely regular, if any closed set F of X and a point x ^ F can be separated by a continuous 
function / : X [0, 1] in the sense that f{x) = 0,f(F) = {1}, 

• sequential, if its sequentially closed subsets of X are exactly the closed sets, 

• of first category, if it is a countable union of nowhere dense sets, 

• totally disconnected, if its connected components are singletons, 

• zero dimensional, if it has a clopen base, 

• first-countable, if every point has a countable neighborhood basis, 

• second-countable, if the topology has a countable base. 

A function / : X — > Y between metric spaces (X,d) and (Y,d') is Lipschitz if d' (f (x) , f (y)) < C • d(x,y) 
for some constant C > 0. In particular, Lipschitz functions are continuous. 

In this article, a measure is always understood as a probability measure on the Borel subsets of Z z . 
A measure pL is shift invariant if n(C) = /x(a(C)) for all Borel sets C C Z z . It is known (see [3]) that 
all measures are regular, meaning that for every Borel set C C I z we have jJ.(C) = inf '{/x(U) \ C C 
U, U open}. For two measures ji. and v, we denote /x » v if jJ.(C) = implies v(C) = 0. 

A shift invariant measure /I is ergodic, if cj(C) = C implies /i(C) G {0, 1}. Examples of ergodic 
measures are the Bernoulli measures jl p , given by a map p : A — > [0, 1] with £ ae A/'( a ) = '> which 

are defined by setting jUp([w]o) = n|=o 1 P(. w d- If A 4 i s ergodic, v is shift invariant and /I » v, then 
H = v. The famous Birkhoff's pointwise ergodic theorem states that if /I is a shift invariant measure and 
/ : E z — > C a /x-integrable function, then 



1 



7! 



/*(*) = lim 



2n + 1 



is defined for /x-almost all x. Also, the function /* is /x-integrable and satisfies 
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3 Problematic Topologies 

When topologizing a set of objects, one usually wishes the topology to be somehow related to the nature 
of these objects, or some structure given to the set itself. In a topological group, for example, the group 
operations (g,h) h-> g-h and g t- > g~ l are required to be continuous. In the case of cellular automata, a 
reasonable requirement is that composition should be continuous, at least with some natural restrictions. 
In order to avoid trivialities, we also require that the space should be nondiscrete and Hausdorff. In 
this section, we briefly discuss some topologies that fail to fulfill these properties, also noting a possibly 
interesting counterexample in topology. 

Definition 1. The pointwise topology on CA has the sets 



indexed by x G Z and a EE, as a subbase. Forx G E ,c G CA, we denote U x (c) = U x (c(x)o). 

This means that a sequence (c,), e N converges to c G CA iff (a(x))i^ converges to c(x) for all x G Z z . 
Equivalently, c ; - — > c if and only if for all x G Z z , we have (c/(jc))o = c(x)o for all large enough i. As 
we will see later, this does not characterize the pointwise topology. In the case |E| =2, this space is a 
kind of 'dual' of the usual Cantor space: The Cantor topology is given on the set of infinite sequences by 
taking the cylinders as a clopen base. In the pointwise topology, we take the cylinders as points (a CA 
corresponding to the cylinder it maps to 1), and take as a subbase the sets of cylinders containing, or not 
containing, a given point. 

The pointwise topology might seem like a very natural choice for topologizing the cellular automata, 
but already the composition operation fails to be continuous: 

Example 1. Composition is not sequentially continuous (and thus not continuous) in the pointwise topol- 
ogy, even in REV with the alphabet {0, 1}. Namely, let i G N, and consider the automata c, and d\ that 
behave as the identity, except that Cj transforms every pattern of the form 1 10'aO 1 1 1 to 1 10'dO'l 1, and dj 
transforms every l+i aaO' \0 l aa0' +i to l+i aa0'\0 l aa0' +i . It is routine to check that these automata are 
reversible (as they have period!), and that in the pointwise topology, both (c,-) and (dj) converge to the 
identity automaton. However, (cjodj) does not, since the central cell of c^di^QlQT)) is always 0. 

Of course, composition from either side with a constant cellular automaton is continuous. 

The pointwise topology has some interesting properties: the reversible cellular automata form a 
closed set, and the set of surjective automata is dense. Note that since we have not proved that this space 
is sequential (and in fact it is not: see Theorem[T]), we cannot prove these propositions using sequences. 

Proposition 1. The set REV is closed in the pointwise topology. 

Proof. Let c be a point in CA — REV. Since a cellular automaton is reversible if and only if it is re- 
versible on periodic points, we have that c(x) = c(y) for some p-periodic points x and y. Now, let 
X p = {z | o p (z) = z], and consider the set U = f\ex U z (c), which is an open neighborhood of c. Then, 



Proposition 2. The set SUR is dense in the pointwise topology. 

Proof. Let c G CA be arbitrary, and let U = f] xeX U x (c) be a neighborhood of c, where X C Z z is finite. 
Let n G N be such that every pair x ^ y G X differs in some coordinate in N(n). We assume that Z = 
{0, ... ,k — 1}, so that we can use addition modulo k on Z. For a centered word w G Z^M, we define 



U x (a) = {c G CA | c(x)o = a} 




□ 
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X(w) G £ as the letter x n+ \, if x N <„\ = w for some x G X, and otherwise. Note that X(w) is well-defined 
by the choice of n. 

We define the automaton d G CA, which has radius n + 1 , by 

d(x) = c(x)o + x n+ \ —X(xN(n)) mod k 

for all x G £ z . We immediately see that d(x)o = c(x)o for all x G X, so that d £ U. Also, J is right- 
permutive, meaning that d(v) ^ d(w) holds whenever v,w G £ A '(' 1 + 1 ) differ only in their rightmost coor- 
dinate n+l. It is clear that such automata are always surjective, so the claim is proved. □ 

We note that whether a set F C CA is dense in another set G C CA in the pointwise topology has a 
natural interpretation even outside our formalism: this is the case if and only if whenever one chooses an 
automaton c G G and a finite set of points X C £ z , some automaton d G F satisfies d(x)o = c(x)o for all 
xGX. 

The pointwise topology has the curious property that, while the underlying set is countable, the space 
is not first-countable. To our knowledge, examples of such spaces are nontrivial to construct. Of the 143 
examples in Ifl2l . only 5 spaces are countable but not first-countable, and we compare them with the 
pointwise CA space in Table [T] 

Theorem 1. The space CA with the pointwise topology has the following properties: 

• It is countable. 

• It is Hausdorff. 

• It is not compact. 

• It is perfect. 

• It is totally disconnected. 

• It is of first category. 

• It is not first-countable, and in fact no point has a countable neighborhood basis. It is thus neither 
metrizable nor second-countable. 

• It is normal. 

• It is not sequential. 

Proof. Countability and the Hausdorff property are trivial. The lack of compactness follows from 
{U x (a)) xeS z aS £ not having a finite subcover. The space is easily seen to be perfect, since the finite 
intersections of U x (a) form a base, and no such set is a singleton. The space is totally disconnected 
because every U x (a) is clopen (its complement is U&eE-{a} U x (b)), and it is of first category since every 
singleton set is closed and the space is countable. 

As for lack of first-countability, let c G CA, and assume (V,) iG n is a neighborhood basis for c. Since 
(U x (c)) xeI z forms a neighborhood subbasis for c as well, there must exist finite sets X, C E z such that 
ClxeXi U*{ c ) c Vi for all i. If we let X = UieN^' tnen clearly (U x (c)) xe x is already a countable subbase 
for c. Let y £ X, and for each i, let c ; - be the cellular automaton with r(c,) = i which behaves like c, 
except for mapping yr-jj to z(c(y)i). Clearly, c, -ft c since Cj(y)o ^ c(y)o for all i. However, when x^y, 
we have c,- G U x (c) for large enough i, and thus c\ — > c, a contradiction. 

Finally, we prove the normality. Let a,di G CA for all i G N be such that the sets X = {a | i G N} 
and Y = {dj | i G N} are closed and disjoint. Since the space is countable, this covers all the closed 
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normal Hausdorff 

compact 

sequential 

tot. disconnected 

first category 

perfect 


CA with pointwise topology 


yes no no yes yes yes 


26. Arens-Fort space 
35. Q* 
98. Appert space 
99. maximal compact topology 
1 14. single ultrafilter topology 


yes no no yes no no 
no yes yes no yes yes 
yes no no yes no no 
no yes yes yes no no 
yes no no yes no no 



Table 1 : Properties of the countable but not first-countable counterexamples of |[T2l contrasted with our 
space. The space Q* is the one-point compactification of Q. In the table, the spaces which are not normal 
Hausdorff are neither normal nor Hausdorff. The space Q* is in fact connected. The spaces which are 
not perfect do not even contain induced perfect subspaces. 



sets. Inductively for all i G N, we choose clopen neighborhoods £/,• 3 ci,... ,c,- and Vi 3 di,...,di such 
that Ui n Y = Vi HX = Ui (~1 V t = as follows. First, U = V Q = 0. Let C/,_i and V t -\ be defined. Let 
U be a clopen neighborhood of c; such that U DY = 0, and define {/* = U (U — V;_i). Similarly, 
Vi = Vi-i U (V - Ui) for a clopen neighborhood V of d t with VflX = 0. Now, \J t Ui and \J t Vi form 
disjoint open neighborhoods of X and Y, respectively, and this proves normality. 

The lack of sequentiality is seen as follows: The set SUR is dense by Proposition [2j and thus not 
closed, since now SUR C CA = SUR. However, we will prove in Theorem [4] that SUR is closed in an- 
other topology we define in Section [4j and Lemma [4] shows that such sets are sequentially closed in the 
pointwise topology. These results do not depend on this theorem. □ 



Remark 1. In fact, the above proof for normality works as such in any countable space with a clopen 
base. Without countability, we only know that such a space is completely regular. 

We now note a further connection with the Cantor space, namely that the pointwise topology is zero- 
dimensional and perfect. It is known that a compact, perfect, zero-dimensional and metrizable space is 
homeomorphic to the Cantor space. We do not know whether a similar characterization exists for the 
pointwise topology, and whether the 'duality' can be formalized. 

Problem 1. Characterize the pointwise topology on CA using some of its topological properties. 

Finally, we note that one might also define a topology on CA in one of the following ways: c ; - — > c if 
for all x G E z , we have Ci(x) = c(x) for all large enough i, or by defining that c,- — > c if for all w G r°, 
we have q(jc)o = c(x)o for all x G [w]o for large enough i. However, both of these attempts result in the 
discrete topology, as is easily verified. 
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4 Topologizing the CA by measuring preimages: the combinatorial ap- 
proach 

Definition 2. Let c,d G CA, and r a common radius for c and d. Then the difference set of c and d for 

the radius r is 



D c d = {weZ N ^\c{w)rfd{w)}, 
where the CA are taken as word functions c : -»• Z N ( r - r ( c )) and d : -»• E^"^)). The distance 



between c and d is 



8{c,d) = 




We call the space CA with this metric the uniform Bernoulli space. 

The name 'uniform Bernoulli space' comes from the fact that we later define pseudometrics on CA 
using shift-invariant measures, and the pseudometric 8 is given by the uniform Bernoulli measure. The 
value S(c,d) above does not depend on the exact value of r, and it is left implicit in the notation D c d . For 
cod, we always use the radius r(c) + r(d). 

Example 2. The space CA is not totally bounded (and thus not compact): Let {c; G CA | i = 1, . . . ,n} be 
a finite set of automata, and let r = max{r(c,) | i = 1, . . . ,n}. Consider the r + l-shift automaton G r+ . 
Given a word w G T, N ( r+1 ), we have O r+1 (w) ~ c,(w) ijfw r+ \ = c,(wo). Thus 



and CA cannot be covered by a finite number of open (1 — |E| )-balls. 

Example 3. The function (o) : CA 2 — > CA is not continuous. In fact, even the function (d i— >■ doc) is not 
continuous for all c G CA: Le? E = {0, 1} and let c q for q G {0, 1} be the all-q CA defined by c q (x)i = q 
forallxeL z ,ieZ. We let 



Clearly di — > cq, but since dj o c = c\ for all i, d, o cq does not approach cq o cq = cq. 

Here, we used a rather pathological CA as the rightmost argument, and in fact, with suitable restric- 
tions, we do obtain continuity. 

Theorem 2. The restriction of(o)toCkx SUR is continuous. 

Proof. Since CA x SUR is a metric space, it is enough to check that limits of sequences commute with 
(o). So let Cj,c G CA and di,d G SUR with (cj,di) — > (c,d). We need to show c,-od,- — > cod. Let / be large 
enough that 5(c,-,c) < £ and S(di,d) < e, and assume that r(c,) > r(c) and r(di) > r(d). We need an 
upper bound on the size of D c J° d '. 

First, let us give an upper bound for the size of the set A of all words w G £ Ar ( r ( c ;)+ r ( rf ;)) suc h that 
di(w)j / d(w)j for some j G N(c). If w G A, then at least one of v%(^.) +7 - for j G iV(c) must be in D d . 



8{o' 



.r+\ 



,ci) = \-\i\ 




Thus 



|A| < |iV(c)|- D d J - \L\ 2 < Ci \ 



Then, let us give an upper bound for the size of B = D c J° d ' —A. Let w G B, so that (w) N ^ =d(w) N ^ . 
Since w G D c J° d ', we have c,-(J,-(w)) 9^ c(<i(w)) ~ c(J ; -(w)), which implies c,-(J ; -(w)) 9^ c(<i;(w)), and thus 
J/(w) G D^'. Now we have that di(B) C D C J, and since G SUR, the balance property implies 



|S| < \D c ( !\-\L\ 2r{d '\ 



60 



Topology and Cellular Automata 



Of course, 



D' 



cod 



\A \ + which implies 5(c; odj,cod) < (2r(c) +2)e by a direct calculation. 

□ 



The following lemma is proved similarly as the previous one, and shows that composition on the left 
is always continuous. 

Lemma 1. If e G CA, then the function c i— > eoc is (Lipschitz-)continuous in CA. More specifically, 



8{eoc,eod) < (2r(e) + \)-8(c,d) 



for all c,d G CA. 



Proof. Let c,d G CA, and let r = max(r(c),r(c?)). Now if e(c(w)) 76 e(d(w)) for some word w G 
■£N{r+r(e))^ foea necessarily c{w) N ^ ^ d(w) N ( e y Thus we have 



I r> eoc I 



< 



{w^ r+r ^\c{w) N{e) ^d{w) N[e) } 



< \N(e)\- \D d \ -\n Me) . 

The claim again follows by a direct calculation. 

Using the above lemma, we prove that inversion is also continuous. 
Theorem 3. The function 



□ 



-l . 



: REV — > REV is continuous. 

Proof. Let c G REV. We prove that is continuous at c. For that, let d G REV such that r = r(d) > r(c) 
and r 1 = r(d~ l ) > r(c _1 ). 

Let w G L N< < r+ ^ be such that d{w) G Df_\. Then we have C -\d{w)) ^ d-\d(w)) ~ c-\c{w)), 
which implies w G D c c _\° c d . This means that the <i-preimage of D^_', is included in D c c _[° c ,. 

Since d G SUR, we again have D c d _\ ■ |Z| 2r+1 < D c c _\° c d by the balance property. An application of 
Lemma [T] now yields 

8{c-\d- 1 ) <5{c- x o C ,c- x od) < (2r(c- l ) + \)-8{c,d), 
which proves the claim. □ 



The space has no isolated points, and in fact we obtain a large amount of approximation results by 
using the continuity of (o). 

Proposition 3. All ofCk, SUR and REV are perfect as topological spaces. 

Proof. Given CA c, we construct the CA c, with r(ct) = r(c) + i, which are equivalent to c on all words 
, except O^^, on which they disagree with c. Clearly c, — > c, but c,- ^ c holds for all i, and thus 
CA is a perfect space. 

Now consider the cellular automata c, which function as identity maps, except for mapping the central 
cell of 110'aO'll to l{a). It is clear that c,- G REV for all i, since these automata keep the occurrences of 
1 1 untouched, and can only change the central cell between two occurrences. It is also clear that they 
converge to the identity map. 

Now consider an arbitrary surjective CA d. All of the c,- o d are distinct, since d is surjective and 
the Ci are distinct, and by the continuity of (o) restricted to CA x SUR, the sequence c,- o d converges to 
id o d = d. If d is reversible, then so are all a o d. We have obtained that also SUR and REV are perfect as 
topological spaces. □ 
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Theorem 4. SUR is closed. 

Proof. Let c ^ SUR and d G SUR. We show that 8(c,d) is bounded from below by a positive constant 
depending only on c. That is, we will prove there exists e > such that B E (c) n SUR = 0. 

We may assume r(d) > r(c). Since c is not surjective, there exists a word v G E^W for some n such 

that 

|c _1 (v)| >|E| 2r W + l. 

Now let w G E^M^R") be such that c(w) r u) = v. We either have d(w) = v, or Wwd)+i £ D c d for some 
i G N(n). In terms of cardinalities this implies 

\d-\v)\+{2n+\)-\D c d \-\n 2n 
>\c-\v)\-{\L\^-^) 
> (|£| 2r M + l)(|£| 2 M^-'"( c »), 

and since j ^ 1 (v) | = |E| 2r ^ by the balance property, it follows that \Df.\ > (2n + ■ \'L\ 2( - r( - d ^ r ^~ n \ 
But then 8{c,d) > n~ l ■ |£|~ 2 ( r ( c )+«) ; an d the claim is proved. □ 

In fact, by a result in |[T0l , one can choose n < |£| 2 '' c ' in the previous proof, yielding a bound 

8(c,d) > |E| 



-2{r(c)-\T.\ 2r[c) ) 



Like surjectivity and inversion, commuting with a fixed cellular automaton is another intuitively 
'one- step' property, and thus should behave well in our topology. Using the continuity of o, we indeed 
verify this intuition, and prove that the limit of CA commuting with a given surjective CA also commutes 
with the CA. 

Example 4. The commutator of a surjective CA c is closed. Consider the function d i— > 8{cod,doc). 
Since composition from both left and right by a surjective cellular automaton is continuous and the 
metric is a continuous function CA 2 — > R, we obtain that the preimage ofO is a closed set. But obviously 
this is just the commutator. 

Example 5. For each p>l, the set of p-periodic CA is closed. Namely, if Ci — > c and cf = id for all i, 
then by the continuity of composition on REV we have c p = id. 

As opposed to the above example and the case of SUR, the set REV of reversible cellular automata is 
not a closed subset of the uniform Bernoulli space. 
Example 6. The set REV is not closed if\L\ > 3. Let c be the CA 



c(x)j 



Xj +xj-i mod 2, ifxj G {0, 1} 
2, otherwise 



That is, c is the XOR-with-left-neighbor automaton, which addionally fixes 2s and interprets them as 0s. 
We let Cj behave as c, except for fixing the current cell if they see no 2's in the neighborhood N {i). Since 
pv(7)j — > 0, we see that c\ — > c. 

Of course, c is not in REV, since °°0°° and °°1°° have the same image. All of cu however, are in REV: 
Let Ci(x) = Cj(y) for some x^y. Since 2 's are never created or destroyed, x and y have 2 's in the same 
coordinates, and thus for both x and y, the set of coordinates that are fixed when applying XOR are equal. 
In every point, a fixed coordinate must occur at least every \N(i)\ steps, since if no 2's occur in a block 
of size \N(i)\, the middle cell is fixed. But these two facts clearly imply x andy are equal, which proves 
the claim. 
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We show that the uniform Bernoulli space is not complete either, which strengthens the intuition 
given in Example [2] that sequences of cellular automata quite rarely converge to anything (as one might 
expect). 

Example 7. The space CA is not complete. We will show this by constructing a Cauchy sequence (c,) in 
CA without a limit. Start with the identity CA c\ and for each i > 1, define Cj recursively as follows: Let 
(n/) be a sequence in N with nt > i and the property that 

k-l i 

£ \Z\- n '+j < -"-.|i|-£/=i"< (1) 
;=i 2 

for all j, k G N with j < k. This is the case, if (n,-) grows fast enough. 
Let rj = r(a) = Y!k=i n k> and for all words w G T, N ( r '\ define 

c Aw) = i ^-iM )' i f w lr(ci)-n i +lAc i )] = ° ni 
\ c,-_i(w)o, otherwise. 

Then 5(c,-,c, + i) = < |£| ~'/or all i, and by the triangle inequality 

k-l k-l 



5(c/,ci +t ) < 5(c i+ ;,ci+;+i) < |Z| '52 I 1 ! ; < 



7=0 7=0 



l-in 



-1 • 



so the sequence (c ( ) is Cauchy. 

Moreover, for each i,k G N w/?/z i < k, equation ([T]) implies that 

k-l 

8(ci,c i+k ) > 5(c/,c i+ i) - 52 5(ci + y,Ci +J - + i) 

7=1 

= |Ep ni - £ isr ni+j ' > - • |Ep" f , 

7=1 2 

so none of the terms c, can »e a limit for the sequence. 

Consider then an arbitrary CA c not in the sequence (c,-). We may choose it to have the radius r(c,) = 
LJt=i n kfor some large enough i. We show that c is not a limit for (c,). First, note that 8(c,Cj) > |£p^ c '' 
holds. But now equation ([T]) implies that 

k-l 

S(c,c i+k ) > 8(c,ci) - 52 S(c i+ j,c i+ j + i) 

7=0 

k-l i 
7=1 2 

/or a// k > /. TrtMS c is not a limit for (c,). 

Finally, let us briefly discuss the homomorphisms of Q and (2), and some dynamical notions, in the 
uniform Bernoulli space and the pointwise topology. 

Definition 3. Let r be a radius ofc£ REV, and let 

R c = {(X[ 0j 2r_i],c(x)[_ r) ,_i]) | X G 
IT? I 

ffte sc? of right stairs of c. We define h + {c) = lyj^ G Q. 
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The map h + : REV — > Q is in fact a group homomorphism from (REV, o) to (Q>o, •)> and it compares 
the information flows to the left and to the right in the evolution given by an automaton Q. We show by 
example that this function is not continuous, even though its definition is concerned with only one step. 
Example 8. The morphism h + is not continuous at least with the alphabet £ = {0, 1} x {0, 1}. First, it 
is easy to see that for all i, there exists a length kj and a finite set of words Wj C T. k ' with \Wj\ < \\^\ k ', 
such that Wi defines an empty SFT when taken as the set of forbidden patterns. For y G {0, 1} Z , define 
Jiiy) = {j G 7L | yyj+ki-i} G Wi}. Clearly, there must exist n such that Z — Ji(y) does not contain an 
interval of length nfor any y, that is, coordinates j such that yyj+ki-1] £ Wi must occur with bounded 
gaps. For j G Jiiy), we define Pi(j,y) as largest f G Jiiy) such that f < j. 

We now define 

c-(x) ■ = { *>' ifj£Ji(ni( x )) 
,y h \ (n l (x)j,n 2 (x) f ), ifjeJi(m{x)), 

where j' = Pi(j, K\ (x)). That is, we shift information to the right at the positions marked by Wi. 
From S| — > 0, it clearly follows that ci — > id. It is also easy to see that h + (ci) = 2 for all i. 

We note that there is a more well-known homomorphism defined on (REV,o) called the gyration 
function, defined in [2]. Since this function is based on behavior on periodic points, it is very easy to 
find a counterexample for continuity in the uniform Bernoulli space, and a proof of continuity in the 
pointwise topology, when the codomain nr=i Z/nZ is given the product topology. 

The previous example also shows that sensitive cellular automata are not a closed set, as each c ( is 
sensitive, but their limit is not. 
Example 9. If c G CA, define the entropy of c as 

h(c) = hm hm — -, 

r— >°of— >°° t 

where N c (r,t) is the number of different r x t-rectangles in all spacetime diagrams of c. The entropy 
function is not continuous in general, even in REV: Let £ = {0, 1} , and define the CA c ; - as the identity 
CA, except that maximal patterns of the form 

a\ /oy'ifl 2 foy'i /oy'ia„ 

where n>2 and the aj,bj G Z are not part of any word ({]) ' }, are rotated to 

b\ foy'iai /oy'i /0y'la„_i 

b 2 \0J lb 3 \0) 1 ••• \0J la n ■ 

This is clearly doable with a local rule that checks its surroundings for the marker patterns, and the 
resulting CA are reversible with Ci — > id. Now given a word w G S r and t G N, we append the word 

/oy'ia toy \a /oy'i /oy'ia 
\q) ib, \o) ib 2 \o) i ••• [a) tfc t _! 

to w, producing in the spacetime diagram an r x t-rectangle with w on the bottom and b\b2 ■ ■ - b t -\ on 
the lower track of the right border. Thus we have that N Ci (r,t) > 4 r • 2 f ~\ so 

u r rlog4 + (f-l)lo g 2 

h(ci) > hm hm = log2. 

r— ><x>f— >■» t 

But the identity CA has zero entropy: h(id) = 0. 

We conclude this section with the following two open problems. 
Question 1. Is h + : REV —> Q continuous when REV has the pointwise topology? 
Conjecture 1. Transitive CA are not closed in the uniform Bernoulli space. 
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5 Topologizing the CA by measuring preimages: the measure theoretic 
approach 

Definition 4. Let ji. be a measure on Z z . We define the pseudometric 

8»(c,d) = n(iD c d ]o)- 

In this way each measure fX gives a topology for CA. We denote a A c if a — > c holds in this topology. 
Remark 2. The pseudometric 8^ is a metric iff fx has full support. 
It is easy to see that c, A c holds iff 

ri(c-\C)Ac7\ C ))^0 

holds for all clopen sets C C L z . As we noted in the previous section, the pseudometric defined by the 
uniform Bernoulli measure coincides with that of the uniform Bernoulli space. 

We now finish the proof of non-sequentiality in Theorem [T] The result follows from the following 
connection between the difference set topologies and the pointwise topology. Note that since the point- 
wise topology is not sequential, Proposition [4] does not imply that it would be finer than the difference 
set topologies. 

Lemma 2. Let ji.be a measure on and let X\,X2, ... be a countably infinite family ofBorel sets with 
jU(X,-) > £ for some £ > 0. Let X be the set of points x 6 £ z which appear in infinitely many of the X,-. 
Then fi(X) > £. 

Proposition 4. Let il be a measure in £ z . If Ci — > c in the pointwise topology, then 5^(c;,c) — > 0. 

Proof. Assume the contrary. Without loss of generality, there then exists a positive £ with S^(ci,c) > £ 
for all /. Denoting Xj = [D c c .]o, the conditions of the above lemma are satisfied, and the set 

X = {xeL z \ Cj(x)o ^ c(jc)o for infinitely many i] 

has /J. -measure at least £. In particular, X is nonempty, which contradicts the fact that c,- — > c in the 
pointwise topology. □ 

The following result shows that the difference set topologies are quite numerous and varied in form. 

Theorem 5. If il and V are measures on £ z , pL / v, and pL is ergodic, then }i and V induce distinct 
topologies on CA. 

Proof. First, it is necessarily true that jj, ^> v, so we have a measurable set BCI Z such that ju(fi) = 
and v(B) = q > 0. By regularity, for every i £ N we find an open set t/,- 5 B such that ju(f/;) < i~ l and 
v(Ui) < q + i~ l . We can suppose that Ui + \ C Ui for all i. 

For every i we have an ascending chain of clopen sets Cj C Cf C . . . C Ui such that [/, — Cf \ 0. 
From this it follows that v(Cf) — > v(f/;). Thus, for all / G N, we can take a number Mi £ N such that 
v(Ci) > q-T l holds for the set C,- = cf'. If j > i, we also have that 

v(C,nCy) > v(Q) + v(C ; -)-v(C/i) >q-3r\ 

Furthermore, it is clear that jLl(Q) < ju(f/;) < i -1 . 
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Let Wj C be a finite set of words such that C, = [W,-]o- Define a sequence in CA as follows. The 
CA Ci acts as the identity except on words of W,-, on which it applies the permutation z to the central cell. 
Now Df t = Wi, so that 

5"(id,Ci) = M(Ci)-X> 

but 

d v {id,c i ) = v(Q)^q>0. 
Thus Ci A id holds, but c, A id does not, which proves the claim. □ 

Corollary 1. All Bernoulli measures induce distinct topologies on CA. No other shift invariant measure 
can induce these topologies. 

We now reprove most of the combinatorial results of Section [4] in the measure theoretic framework. 
In some cases, the proofs become somewhat shorter and cleaner. 

Definition 5. Let ji. be a measure on £ z . Then PRES^ denotes the set of ^.-preserving CA, that is, 

PRESjj = {c G CA | n(c~ l (B)) = n(B) for all ^-measurable B C 

The following theorems generalize the results for the uniform Bernoulli measure pt, and are proved 
using the same ideas. Note that in this case, SUR = PRES^. 

Theorem 6. Let ji.be a measure on £ z . The restriction o/(o) toCAx PRES^ is continuous with respect 
to S^. 

Proof. Let c, G CA and d\ G PRES^ for all i, and c; A c, d[ A d. We will prove that c, o d{ A c o d. 
For that, let C C Z z be a clopen set. Denote 

A i = dr\cr 1 (C))Adr\c- 1 (C)), 

the <i ( -preimage of the set where c, and c differ w.r.t. C, and 

Bi = dr 1 (<T 1 (C) ) Ad~ 1 ( c - 1 (C) ) , 

the set where J,- and J differ w.r.t c~ ! (C). It is clear that 

dr 1 ( c r 1 (C) ) A dr 1 (c~ 1 (C) ) c A,- u B, . 

Since J,- preserves jU and c _1 (C) is a clopen set, we have that 

A t(A,) = M(cr 1 (C)Ac- 1 (C))^0 

and 

At (Bi) =n(d- 1 (c- 1 (C) ) Adf 1 ( c - 1 (C) ) ) -> 0, 
from which the claim then follows. □ 

Theorem 1. If \l is a measure and d G CA, then c^f doc is continuous with respect to 8^. 
Proof. Let c, A c and C a clopen set. Since d~ 1 (C) is also clopen, we have 

uddoay^QAidocy^c)) 
= n(c; 1 {d- l (c))Ac- i (d- l (c)))^o. 

□ 
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Theorem 8. If il is a measure, then (•) 1 is continuous on PRES^ PlREV. 

Proof. Let a A c in PRES^ n REV, and C a clopen set. Clearly, also cj l G PRES^, for all i. Denote 
D = c(C), which is also clopen. Now 

= il( Ci (C)Ac(C)) 

= il( Ci (c- l (D))AD) 

= ii(c-\D)Ac7\D))^0. 

□ 

Theorem 9. Let p, be a measure. Then PRES^ is closed in CA^. 

Proof. Let c £ PRES^, . Now we have a measurable set B such that p(c~ l (B)) > p(B). Since p is regular, 
we can find an open set U 2 B such that p(U) < /x(c _1 (fi)) < ju(c _1 (C/)). Now £/ can in turn be 
approximated with clopen sets C\ C C2 C . . . C f/ such that U — Cj\ 0, which implies jll(C/) — )■ ju(C/) 
and/i(c _1 (C/)) ->■ /i(c _1 (E/)). Take 7 large enough that ju (Cy) < p(U) < p(c~ l (Cj)) and denote C = Cj. 

Suppose then that q G PRES m for all i G N, and c, A c. Now in particular p(cj l (C)) ->■ jU(c _1 (C)) > 
ju(C). But this contradicts the assumption q G PRES^. □ 

Next, we show how the difference set spaces can also be obtained by integrating different distance 
functions over the space Z z . In the case of the Besicovitch pseudometric (defined below), we have an 
exact correspondence with the respective 8^ . 

Theorem 10. Let p be a shift invariant measure, and define, for all c,d G CA, 

8fl(c,d) = / dc(c(x),d(x)) dp(x). 

Then 8^ and 8^ are uniformly equivalent pseudometrics. 
Proof. First, note that 

8g(c,d) > [ d c (c(x),d(x)) dp{x) > n{[D c d } ). 

Second, we have 




□ 

Definition 6. Let x,y G £ z . The Besicovitch distance of x and y is 

1 / \ y \{\A <n\xi^yi}\ 

d B (x,y) = hmsup — — , 

n _^oo zn -+- 1 

that is, the limit superior of the density of their differences as the size of the observation window grows 
without bound. 
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The Besicovitch distance is a pseudometric and defines a topology on £ different from the usual 
Cantor topology. The following theorem gives a connection between the pseudometrics 8^ and ds- 

Theorem 11. Let \ibe a shift invariant measure, and define, for all c,d G CA, 

Sn(c,d)= / dB(c(x),d(x)) dn(x). 

Then we have 8^ = 8g. 

Proof. Let c,d <E CA with common radius r. Now the function l[zy] is /^-integrable, so by the ergodic 
theorem we have that 

1 - 

lfnci fx) = Km V Imnfx) 

i——n 

= Hm \{i€N(n)\x N{r)+i €iy d }\ 
«-><*> 2n + 1 

is defined for -almost all x, and 

The claim directly follows, since now 1* DC ] (jc) = dB(c(x),d(x)) for /i-almost all x G £ z . □ 

6 Future Work 

In the future, it would be interesting to consider connections between dynamical notions such as transi- 
tivity, mixingness, sensitivity and entropy by proving that under some additional natural constraints on 
the topologies considered, closedness of sets of cellular automata with a (possibly parametrized) dynam- 
ical property, and continuity of certain dynamical invariants automatically imply other closedness and 
continuity results in any topology. Such results, if any can be proved, could perhaps imply (or inspire) 
interesting new connections outside our topology framework between dynamical notions. We are also 
interested in whether entropy can be made continuous with a natural topology based on the long-term 
behavior of cellular automata. 

We would also like to study the sequentialization of the pointwise topology, the topology whose 
closed sets are exactly the sequentially closed sets of the pointwise topology. This is easily seen to 
define a sequential topology. This topology is finer than the pointwise topology and any of the difference 
set topologies, and both REV and SUR are closed sets (by Proposition [T] and Theorem [4| respectively). 
However, it can be shown that the space is still perfect. 
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